INTERNATIONAL APPL. NO. INT. FILING DATE PRIORITY DATE CLAIMED 

PCT/US2004/037764 1 2 November 2004 1 2 November 2003 



Amendments to the Claims 

This listing of claims will replace all prior versions, and listings, of claims in the 
application: 

Listing of Claims: 

We claim: 

1. (Original) A kit containing: 

i) a scraping instrument for collecting a biological sample, comprising: 

a) a proximal handle end; 

b) a distal collection end; and 

c) a joining portion between the handle end and the collection end; 

wherein the joining portion is generally continuous in width with the handle end and 
the collection end on either side of the joining portion; and the joining portion allows the 
handle end and the collection end to be optionally detached from each other; and 

wherein the collection end further comprises a peripheral edge and a depression, 
wherein at least some of the peripheral edge of said collection portion is serrated to allow 
scraping of the biological sample, and the depression allows the scraped biological sample to 
be collected; 

ii) a storage vessel; and 

iii) a stabilizing solution. 

2. (ORIGINAL) The kit of claim 1, wherein said collection end is spoon shaped. 

3. (ORIGINAL) The kit of claim 1, wherein the instrument comprises plastic. 

4. (ORIGINAL) The kit of claim 1, wherein the joining portion comprises a perforation. 

5. (ORIGINAL) The kit of claim 1, wherein the length of the instrument from about the 
proximal end of the handle end to the distal end of the collection end is about 3-6 inches. 

6. (Canceled) 

7. (ORIGINAL) The kit of claim 1, wherein the length and the width of the collection end 
allow the collection end to fit into a storage vessel. 

8. (Canceled) 

9. (Canceled) 

10. (Canceled) 

1 1 . (Canceled) 

12. (Canceled) 

13. (Currently AMENDED) An RNA collection system, comprising: 
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(a) a scraping instrum e nt having a proximal handle end, a distal collection end 

comprising a serrated periph e ral e dge, and a joining portion between the handle end and the 
collection end, the joining portion allows the handle end and the coll e ction end to b e 
optionally d e tach e d from e ach oth e r; and 

(b) a storag e vessel comprising an RNA stabilization solution. The kit of claim K 

wherein the scraping instrument is used for collecting RNA and the stabilizing solution is an 
RNA stabilization solution. 

14. (ORIGINAL) The kit of claim 13, wherein the storage vessel contains a lid. 

15. (Canceled) 

16. (Canceled) 

17. (ORIGINAL) A non-invasive method for obtaining isolated nucleic acid from mouth 
epithelial cells, comprising: 

(a) transferring non-invasively isolated cells from a subject's mouth to a nucleic 
acid stabilization solution that inactivates nucleases, and 

(b) extracting the nucleic acid of interest from the isolated cells, to obtain an 
isolated nucleic acid sample. 

18. (Canceled) 

19. (Canceled) 

20. (ORIGINAL) The method of claim 17, wherein the nucleic acid is RNA. 

21. (ORIGINAL) The method of claim 17, wherein the cells are isolated non-invasively 
from the mouth by scraping with a scraping instrument. 

22. (ORIGINAL) The method of claim 21, wherein the scraping instrument is a plastic tool 
capable of collecting a large number of epithelial cells from buccal mucosa in relatively non- 
invasive fashion, wherein the plastic tool comprises a serrated edge to scrape off several 
layers of epithelial cells, and a curved surface to collect those cells. 

23. (ORIGINAL) The method of claim 20, wherein the sample of scraped cells in the RNA 
stabilization solution is stored at -15 to -25° C prior to extraction of the RNA from the sample. 

24. (ORIGINAL) The method of claim 23, wherein the RNA stabilization solution is 
RNALater RNA stabilization reagent. 

25. (Canceled) 

25. (Original) The method of claim 24, wherein the gene of interest is expressed in 
subjects who have lung cancer and not expressed in subjects who do not have lung cancer. 

26. (Canceled) 

27. (Canceled) 

28. (Canceled) 
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29. (Canceled) 

30. (Canceled) 

3 1 . (CURRENTLY AMENDED) A method of determining whether an individual is at 
increased risk of developing a lung disease, comprising: 

a) taking a biological sample from the mouth of an individual exposed to an 
airway pollutant or at risk of being exposed to an airway pollutant; and 

b) analyzing whether there is a genetic alteration in at least one gene of the 
mouth transcriptome genes of claim 29 the group consisting of ABCC1; ABHD2; 
AF333388.1: AGTPBP1: AIP1: AKR1B10AKR1C1: AKR1C2; AL1 17536.1: AL353759; 
ALDH3A1; ANXA3: APLP2: ARHE: ARL1; ARPC3: ASM3A: B4GALT5: BECN1: 
Clorf8; C20orfl 11; C5orf6: C6orf80: CA12: CABYR: CANX: CAP1: CCNG2; CEACAM5: 
CEACAM6; CED-6; CHP: CHST4: CKB: CLDN10: CNK1: CQPB2: CQX5A; CPNE3; 
CRYM: CSTA: CTGF: CYP1B1; CYP2A6: CYP4F3: DEFB1: DIAPH2: DKFZP434J214; 
DKFZP564K0822: DKFZP566E144: DSCR5; DSG2: EPAS1: EPOR; FKBP1A; FLJ10134; 
FLJ13052: FLJ130521; FLJ20359; FMQ2: FTH1; GALNT1: GALNT3: GALNT7; GCLC: 
GCLM: GGA1: GHITM; GMDS: GNE: GPX2; GRP58; GSN; GSTM3: GSTM5: 
GUKkHIGl: HIST1H2BK: HN1: HPGD: HRIHFB2122: HSPA2: IDH1: IDS: IMPA2; 
ITM2A; JTB; KATNB1: KDELR3: KIAA0397: KIAAQ9Q5:KLF4: KRT14; KRT15; 
LAMP2:LOC51186: LQC57228: LQC92482: LOC92689: LYPLA1; MAFG; ME1; 
MGC4342: MGLL: MT1E: MT1F; MT1G: MT1H: MT1X; MT2A; NCOR2; NKX3-1: 
NOOl: NUDT4; ORL1; P4HB; PEX14; PGP: PRDX1; PRDX4; PSMB5: PSMD14; 
PTP4A1: PTS;RAB11A;RAB2; RAB7: RAP1GA1: RNP24; RPN2:S100A10: S100A14; 
S100P: SCP2; SDR1: SHARP1; SLC17A5: SLC35A3: SORD: SPINT2: SOSTM1; SRPUL; 
SSR4; TACSTD2: TALDQ1: TARS: TCF7L1: TIAM1: TJP2; TLE1; TM4SF1; TM4SF13: 
TMP21; TNFSF13: TNS: TRA1; TRIM16; TXN: TXNDC5; TXNL; TXNRD1: UBE2J1: 
UFD1L; UGT1A10; YF13H12; and ZNF463 , wherein the presence of a genetic alteration in 
one or more of the mouth transcriptome genes as compared to the same at least one gene in a 
group of control individuals is indicative that the individual has an increased risk of 
developing a lung disease. 

32. (ORIGINAL) The method of claim 31, wherein the genetic alteration is selected from 
the group consisting of deviation of a gene's DNA methylation pattern and deviation of a 
gene's expression pattern. 

33. (ORIGINAL) The method of claim 32, wherein the genetic alteration is a deviation of 
a gene's expression pattern. 
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34. (ORIGINAL) The method of claim 33, wherein the air pollutant is smoke from a 
cigarette or a cigar and the lung disease is lung cancer. 

35. (ORIGINAL) The method of claim 34, wherein the lung cancer is selected from 
adenocarcinoma, squamous cell carcinoma, small cell carcinoma, large cell carcinoma, and 
benign neoplasms of the lung. 

36. (CURRENTLY AMENDED) The method of claim 31 or 35 31, wherein the individual is 
a smoker and one looks at expression of at least one gene selected from the group consisting 
of mouth transcriptome genes, wherein lower expression of that at least one gene in the 
smoker than in a control group of corresponding smokers is indicative of an increased risk of 
developing lung cancer. 

37. (ORIGINAL) The method of claim 36, wherein lower expression of at least three 
genes of the mouth transcriptome is indicative of an increased risk of developing lung cancer. 

38. (canceled) 

39. (Canceled) 

40. (Canceled) 

41. (Canceled) 

42. (Canceled) 

43. (Canceled) 

44. (ORIGINAL) A method of diagnosing predisposition of a smoker to lung disease 
comprising analyzing an expression pattern of one or more genes selected from the group 
consisting of ABCC1; ABHD2; AF333388.1; AGTPBP1; AIP1; AKR1B10AKR1C1; 
AKR1C2; AL1 17536.1; AL353759; ALDH3A1; ANXA3; APLP2; ARHE; ARJL1; ARPC3; 
ASM3A; B4GALT5; BECN1; ClorfS; C20orfl 11; C5orf6; C6orf80; CA12; CABYR; 
CANX; CAP1; CCNG2; CEACAM5; CEACAM6; CED-6; CHP; CHST4; CKB; CLDN10; 
CNK1; COPB2; COX5A; CPNE3; CRYM; CSTA; CTGF; CYP1B1; CYP2A6; CYP4F3; 
DEFB1; DIAPH2; DKFZP434J214; DKFZP564K0822; DKFZP566E144; DSCR5; DSG2; 
EPAS1; EPOR; FKBP1A; FLJ10134; FLJ13052; FLJ130521; FLJ20359; FM02; FTH1; 
GALNT1; GALNT3; GALNT7; GCLC; GCLM; GGA1; GHITM; GMDS; GNE; GPX2; 
GRP58; GSN; GSTM3; GSTM5; GUK1;HIG1; HIST1H2BK; HN1; HPGD; HRIHFB2122; 
HSPA2; IDH1; IDS; IMPA2; ITM2A; JTB; KATNB1; KDELR3; KIAA0397; 
KIAA0905;KLF4; KRT14; KRT15; LAMP2;LOC51 186; LOC57228; LOC92482; 
LOC92689; LYPLA1; MAFG; ME1; MGC4342; MGLL; MT1E; MT1F; MT1G; MT1H; 
MT1X; MT2A; NCOR2; NKX3-1; NQOl; NUDT4; ORL1; P4HB; PEX14; PGD; PRDX1; 
PRDX4; PSMB5; PSMD14; PTP4A1; PTS;RAB1 1 A;RAB2; RAB7; RAP1GA1; RNP24; 
RPN2;S100A10; S100A14; S100P; SCP2; SDR1; SHARP1; SLC17A5; SLC35A3; SORD; 
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SPINT2; SQSTM1; SRPUL; SSR4; TACSTD2; TALDOl; TARS; TCF7L1; TIAM1; TJP2; 
TLE1; TM4SF1; TM4SF13; TMP21; TNFSF13; TNS; TRA1; TRIM 16; TXN; TXNDC5; 
TXNL; TXNRD1; UBE2J1; UFD1L; UGT1A10; YF13H12; and ZNF463 in a biological 
sample taken from the mouth of the smoker, wherein a divergent expression pattern of one or 
more of these genes as compared to the expression pattern of these genes in group of control 
individuals is indicative of the predisposition of the individual to lung disease. 

45. (CURRENTLY AMENDED) A The method of diagnosing predisposition of a smok e r to 
lung dis e as e claim 44, comprising analyzing an expression pattern of one or more genes 
selected from the group consisting of AGTPBP1; AKR1C1; AKR1C2; ALDH3A1; ANXA3; 
CA12; CEACAM6; CLDN10; CYP1B1; DPYSL3; FLJ13052; FTH1; GALNT3; GALNT7; 
GCLC; GCLM; GMDS; GPX2; HN1; HSPA2; MAFG; ME1; MGLL; MMP10; MT1F; 
MT1G; MT1X; NQOl; NUDT4; PGD; PRDX1; PRDX4; RAB1 1A; S100A10; SDR1; 
SRPUL; TALDOl; TARS; TCF-3; TRA1; TRIM 16; and TXN in a biological sample taken 
from th e mouth of th e smoker, wherein a div e rg e nt e xpr e ssion pattern of on e or mor e of th e s e 
gen e s as compar e d to th e expression patt e rn of th e se g e n e s in group of control individuals is 
indicative of th e pr e disposition of the individual to lung dis e as e. 

46. (Canceled) 

47. (Canceled) 

48. (Canceled) 

49. (CURRENTLY AMENDED) The method of any of claims 31 18 44, wherein the 
biological sample is a nucleic acid sample. 

50. (CURRENTLY AMENDED) The method of claim 49, wherein the nucleic acid is RNA 
or DNA . 

51. (ORIGINAL) The method of claims 50, wherein the analysis is performed using a 
nucleic acid array. 

52. (Original) The method of claim 50, wherein the analysis is performed using 
quantitative real time PCR or mass spectrometry. 
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